Locating topics in text processing

نویسنده

  • Eleni Miltsakaki
چکیده

In this paper we are concerned with the location of topics in text processing and the determination of the update unit in looking up topic continuations and topic shifts. Using key elements of the Centering Model of local discourse coherence and empirical evidence from Modern Greek and Japanese we argue that the appropriate update unit for topic tracking is the sentence in its traditional sense and not the nite clause, thus providing an account for the status of the subordinate clause in the calculation of topic transitions. We bring forth an argument from English, Modern Greek (MG) and Japanese for keeping topic and information structure distinct. We brie y discuss the signi cance of the current work to automated essay scoring and coreference-based summarization systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of Superimposed Caption

The automatic extraction and reading of news captions and annotations can be of great help locating topics of interest in digital news video archives. To achieve this goal, we present a technique, called Video OCR, which detects, extracts, and reads text areas in digital video data. In this paper, we address problems, describe the method by which Video OCR operates, and suggest applications for...

متن کامل

Multimodal dialogue segmentation with gesture post-processing

We investigate an automatic dialogue segmentation method using both verbal and non-verbal modalities. Dialogue contents are used for the initial segmentation of dialogue; then, gesture occurrences are used to remove the incorrect segment boundaries. A unique characteristic of our method is to use verbal and non-verbal information separately. We use a three-party dialogue that is rich in gesture...

متن کامل

Locating Presence and Positions in Online Focus Group Text with Stance-Shift Analysis

Social cues in online focus groups surface in the ways group members manipulate language, to signal their attitudinal shifts in position toward the group’s topics and what both moderators and members may have said. Their primary mode is task-based: their “job” is to respond to topics introduced by the focus group moderator; they also engage in “sidebar chat” among themselves. Using stance-shift...

متن کامل

A review of text mining approaches and their function in discovering and extracting a topic

Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling.  Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...

متن کامل

بازشناسی متون فارسی با استفاده از مدل زبانی n-gram و پالایش گرامری

Abstract Text recognition has been one of the growing research topics in recent years. Many of these researches have focused on recognition of letters and sub-words as a basis for identifying larger text structures such as words, phrases and sentences. This thesis presents a new method in which the recognized sub-words are combined in order to provide meaningful words and sentences in Farsi tex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999